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Abstract 


The charge asymmetry in the production of top quark and antiquark pairs is mea¬ 
sured in proton-proton collisions at a center-of-mass energy of 8 TeV. The data, cor¬ 
responding to an integrated luminosity of 19.6 fb^^, were collected by the CMS ex¬ 
periment at the LHC. Events with a single isolated electron or muon, and four or 
more jets, at least one of which is likely to have originated from hadronization of a 
bottom quark, are selected. A template technique is used to measure the asymmetry 
in the distribution of differences in the top quark and antiquark absolute rapidities. 
The measured asymmetry is = [0.33 ± 0.26 (stat) ± 0.33 (syst)]%, which is the most 
precise result to date. The results are compared to calculations based on the standard 
model and on several beyond-the-standard-model scenarios. 
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1 Introduction 

The top quark is the heaviest particle in the standard model (SM) and the only fermion with a 
mass on the order of the electro weak scale ITJ. Deviation of its production or decay properties 
from the SM predictions could signal physics beyond the SM. Several proposed extensions of 
the SM include heavy mediators of the strong interaction with axial coupling to quarks, col¬ 
lectively referred to as axigluons |2J. Top quark pair production in axigluon-mediated quark- 
antiquark armihilation can exhibit a forward-backward asymmetry that depends on the in¬ 
variant mass of the system, similar to the asymmetry in fermion pair production mediated 
by Z bosons |l3l. These types of models have been leading candidates for accommodating the 
behavior of tf production in proton-antiproton collisions observed by FNAL Tevatron experi¬ 
ments based on about half of their full data set (5 fb^^) ||ll|5l. Since analyses of the full Tevatron 
data set (10fb^^) indicate smaller values of asymmetry lUIZl/ and since recently improved 
SM-based theoretical calculations lH O predict higher values of the asymmetry than previous 
calculations, the discrepancy between the SM and experimental observations has been greatly 
reduced. Measurements of dijet production IITOtiT^ have constrained the range of axigluon 
masses and couplings lH^ . but the constraints are not applicable to models in which axigluon- 
produced dijet resonances are much broader than the experimental resolution, or which include 
multiparticle final states lll^ . Precise measurement of the charge asymmetry in top quark pair 
production remains one of the best ways to test the limits of validity of SM predictions. 

Experiments at the CERN LHC have reported values of charge asymmetry in top quark pair 
production consistent with SM predictions El 13. Corroboration of results from exper¬ 

iments at the Tevatron using measurements at the LHC is complicated by several differences 
between the two colliders. Eirst, while at the Tevatron the majority of the tt events are pro¬ 
duced via quark-antiquark annihilation, at the LHC the tf production is dominated by charge- 
symmetric gluon fusion, gg —)■ tf. Second, collisions at the LHC are forward-backward sym¬ 
metric, so observation of a charge asymmetry in tf production via annihilation of a valence 
quark and a sea antiquark, qq —> tf, relies on the statistical expectation that the system be 
boosted in the direction of the quark momentum. Any difference in top quark and antiquark 
affinity for the initial quark or antiquark momentum will consequently result in more forward 
production of one and more central production of the other. This forward-central tf charge 
asymmetry at the LHC is diluted relative to the forward-backward tf charge asymmetry at the 
Tevatron since the LHC colliding system does not always have a boost in the expected direc¬ 
tion. Third, a significant portion of LHC tf events are due to (anti) quark-gluon initial states, 
qg (qg), which are charge asymmetric in number density as well as momentum, and which 
also contribute to the final-state forward-central tf asymmetry. Despite these complications, 
the large number of tf events produced at the LHC makes measurement of charge asymmetry 
competitive with the Tevatron measurements as a test of the SM. 

The measurement of tf asymmetry presented in this paper utilizes a template technique based 
on a parametrization of the SM. The technique differs from previous tf asymmetry measure¬ 
ments llHZl Il5] - ll3 . which are based on unfolding the effects of selection and resolution in the 
observable distribution. Reference |[l3 in particular analyzes the same data set, but also dif¬ 
fers in selecting fewer events with higher purity as a result of more restrictive jet transverse 
momentum criteria, and in the methods used to reconstruct tf kinematics and determine the 
sample composition. 

The template technique is presented in Section]^ Data from proton-proton collisions at = 
8TeV were collected in 2012 by the CMS experiment, described in Section]^ Event selection, 
reconstruction of tf kinematics, and a population discriminant are described in Section The 
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2 Analysis strategy 


details of the model used to obtain the result are given in Section]^ and the result is presented 
in Section]^ The analysis is summarized in Section]^ 


2 Analysis strategy 


Charge asymmetry in tf production can be defined for an observable X that changes sign under 
the exchange t o f. If X is distributed with a differential cross section dtr/dX, its probability 
density is 


p{X) = 


1 dcr 
crdX' 


( 1 ) 


This can be expressed as a sum of symmetric (p+) and antisymmetric {p ) components. 


p^{X) = [p{X)±p{-X)]/2. 


( 2 ) 


Statistical kinematic differences between top quarks and antiquarks can be summarized in a 
charge asymmetry. 


Af = f^p{X)dX- r p(X)dX = 2 f^p-{X)dX, (3) 

Jo J-x Jo 

where the observable's maximum value X may be finite or infinite. Previous LHC analyses lllSl - 
[m defined a tf charge asymmetry Ac, based on the difference in absolute rapidities of the top 
quark (i/t) and antiquark (y^), 

Alyltt = lyt| - lytl- (4) 

For the technique described in this paper, it is desirable that the observable X be bounded. The 
hyperbolic tangent is a symmetric and monotonic function, so the transformed observable 

Ytt = tanhAlyltt, (5) 

has the asymmetry A^ and is also bounded. 

Charge asymmetries at production can only be determined from observed data distributions 
using an extrapolation based on a particular model. Past measurements were extrapolated 
using an unfolding technique, which relies on a model for the selection efficiencies and recon¬ 
struction effects An alternative extrapolation discussed in this paper uses a model 

to derive template distributions for the symmetric and antisymmetric components, p^. 

In the present analysis, the next-to-leading-order (NLO) powheg event generator (version 
1.0) II 20 II is used in association with the CTIO ETI parton distribution functions (PDFs) as a 
base model to construct the symmetric and antisymmetric components of the probability den¬ 
sity p(X) for an observable X. These distributions are represented as symmetrically binned 
histograms, given as vectors with a dimensionality equal to the number of bins. A general¬ 
ized model with a single parameter a can be constructed from a linear combination of the base 
model components, 

_ £+ 

The measurement strategy is to find the value of a that best fits the observations. The base 
model charge asymmetry is given by Eq. (j^. The charge asymmetry observed in data is 
then equal to that of the base model scaled by the parameter a: 

Af («) = aAl 


(7) 
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Table 1: The tt initial-state fractions and charge asymmetries in the observable Yjj, calculated 
with POWHEG using the CTIO PDFs. The statistical uncertainty in the last digit is indicated in 
parentheses. 


Initial state 

Eraction (%) 

(%) 

gg 

65.2 

-0.06(3) 

qq 

13.4 

2.95(6) 

qg 

18.2 

1.17(5) 

qg 

3.2 

-0.2(1) 

pp 

100.0 

0.56(2) 



Figure 1: The (left) symmetric and (right) antisymmetric components of the binned prob¬ 
ability distributions in the observable Yjj, constructed using POWHEG Il20l with CTIO PDFs Il2l1 . 
for tt production from gg, qq, qg, and qg initial states. 


Figure presents the distributions in gg, qq, and qg (qg) initial states for X = Yjj, before 
the event reconstruction and selection are applied, and the composition and intrinsic charge 
asymmetries of each initial state are listed in Table Imperfect detector resolution, event re¬ 
construction, and selections can result in distributions of the reconstructed observable Yt®*^ that 
differ from those in Y^j. For this reason, the symmetric and antisymmetric templates, are 
constructed using POWHEG-generated events that are fully reconstructed and pass the selec¬ 
tion criteria. Studies of simulated events show that there is no significant constant bias in the 
charge asymmetry caused by event reconstruction and selection. Thus, the scale parameter a 
in Eqs. can be determined by a fit to the reconstructed distribution in data. 


®data 


= X ' 


XX,, 


( 8 ) 


3 CMS detector and definition of physics objects 

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diame¬ 
ter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip 
tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scintilla¬ 
tor hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Muons 
are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the 
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3 CMS detector and definition of physics objects 


solenoid. Extensive forward calorimetry complements the coverage provided by the barrel 
and endcap detectors. 

The first level of the CMS trigger system, composed of custom hardware processors, uses in¬ 
formation from the calorimeters and muon detectors to select the most interesting events in a 
fixed time interval of less than 4 ^s. The high-level trigger processor farm further decreases 
the event rate from around 100 kHz to around 400 Hz, before data storage. Single-electron and 
single-muon triggers were used to collect events for this analysis. 

The particle-flow event algorithm Il22l 1231 is used to reconstruct and identify each individual 
particle with an optimized combination of information from the various elements of the CMS 
detector. Photons and electrons are defined as clusters in ECAL with a requirement that there 
be a charged-particle trajectory pointing to an electron cluster. The energy of a photon is di¬ 
rectly obtained from the ECAE measurement, corrected for zero-suppression effects. The en¬ 
ergy of an electron is determined from a combination of the electron momentum at the primary 
interaction vertex as determined by the tracker, the energy of the corresponding ECAE cluster, 
and the energy sum of all bremsstrahlung photons spatially compatible with originating from 
the electron track 1241 . The momentum of a muon is obtained from the direction and curvature 
of its combined trajectory in the muon and tracking systems. The energy of a charged hadron 
is determined from a combination of its momentum measured in the tracker and the matching 
ECAE and HCAE energy deposits, corrected for zero-suppression effects and for the response 
function of the calorimeters to hadronic showers. Einally, the energy of a neutral hadron is 
obtained from the corresponding corrected ECAE and HCAE energy deposits. 

Eor each event, after identification and removal of leptons relevant to the sample selection and 
particles from additional proton-proton interactions within the same bunch crossing (pileup), 
hadronic jets are clustered from these reconstructed particles with the infrared- and collinear- 
safe anti-kx algorithm, operated with a size parameter R of 0.5 12511 . The jet momentum is 
determined as the vectorial sum of all particle momenta in this jet, and is found in the sim¬ 
ulation to be within 5 to 10% of the true momentum over the whole transverse momentum 
(px) spectrum and detector acceptance. Jet energy corrections are derived from the simulation, 
and are confirmed with in situ measurements of the energy balance of dijet and photon+jet 
events l26l . The jet energy resolution amounts typically to 15% at lOGeV, 8% at 100 GeV, and 
4% at 1 TeV. An offset correction is applied to jet energies to take into account pileup contribu¬ 
tions. Additional selection criteria are applied to each event to remove spurious jet-like features 
originating from isolated noise patterns in certain HGAE regions. Jets from b quarks are iden¬ 
tified using a discriminant containing information about secondary vertices formed by at least 
three charged-particle tracks, including the number of associated tracks, the displacement from 
the collision point, and the vertex mass, which is computed from the tracks associated with the 
secondary vertex 12711 . 

The missing transverse momentum vector is defined as the projection on the plane per¬ 
pendicular to the beams of the negative vector sum of the momenta of all reconstructed parti¬ 
cles in an event. Its magnitude is referred to as 

A more detailed description of the GMS detector, together with a definition of the coordinate 
system used and the relevant kinematic variables, can be found in Ref. Ii28| . 
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4 Event selection and reconstruction 

Each event is considered under the hypothesis that a top quark and a top antiquark each decay 
into a bottom quark and a W boson, and that one W boson subsequently decays into a pair of 
quarks, while the other decays into a neutrino and either an electron or a muon, producing a 
lepton and jets (^+jets) signature. 

Events are selected from data collected from collisions of protons at 8 TeV center-of-mass energy 
and corresponding to an integrated luminosity of (19.6 ± 0.5) fb^^ i29|. Selected events contain 
at least four jets each with \rj\ < 2.5 and pj > 20GeV, one isolated electron (muon) with 
\r]\ < 2.5 (2.1) and px > 30 (26) GeV. Events are also required to have no other electrons 
(IpI < 2.5, Px > 20GeV) or muons (|p| < 2.5, px > 10GeV). A selected event must have 
an electron with a particle-flow relative isolation Jpp less than 0.1, or a muon with Ipp less 
than 0.12 1^ l30l . Events containing an electron with 0.11 < Ipp < 0.15 or a muon with 
0.13 < Ipp < 0.20 are retained as a control, or sideband, region. The (next-to-) leading jet must 
have Px > 45 (35) GeV. At least one jet must be b-tagged, as defined by the medium working 
point of the combined secondary vertex b tagging discriminant (GSV), which has an efficiency 
better than about 65% and a misidentification probability of about 1.5% Il27ll . In total, 326185 
events are accepted with an electron and jets in the final state, hereafter referred to as the e+jets 
channel, and 340 911 events are accepted in the p+jets channel. 

In addition to tt production, several other processes can produce a £+jets signature that passes 
this selection. In particular, these processes include production of leptonically decaying W 
bosons in association with jets (Wj), Drell-Yan (DY) production of pairs from qq annihi¬ 
lation in association with jets and in which one lepton is not identified, and the production of 
single top (St) quarks accompanied by additional jets. Production of quantum chromodynamic 
multijets (Mj) also contributes to the background. Such events can satisfy the selection if a jet 
is misidentified as an electron or if a muon produced in the decay of a heavy quark passes the 
isolation criteria. 

More than 65% of selected events contain tt pairs. 

4.1 Modeling of signal and background 

The detection of generated particles is fully simulated with the Geant4 software l3TI using a 
detailed description of the GMS detector. The samples account for the observed multiplicity of 
pileup interactions in data. Additional weights are applied after event selection to match the ef¬ 
ficiency of triggers and object identification that are measured in a data sample of Z+jets events 
using a tag-and-probe method Il24ll30ll . The energy difference between each reconstructed jet 
and its corresponding generated jet is scaled to match the {p- and px'dependent) jet energy 
resolution in data, as measured using the dijet asymmetry technique 8261 . 

As mentioned, the tt events are generated with the NLO powheg heavy-quark pair production 
algorithm, using the GTIO PDEs, and interfaced with PYTHIA (version 6.426) for parton show¬ 
ering and hadronization Il32lj34l . Events with W or Z bosons in conjunction with 1, 2, 3, or 4 
jets are generated with leading-order (LO) MadGraph (version 5.1.3.30) Ii35l . also interfaced 
with PYTHIA. A dedicated W+bb sample is used for investigation of systematic uncertainties. 
Events with single top quarks or antiquarks are generated with POWHEG in the s and t chan¬ 
nels [361 , and are generated in the tW charmel using diagram removal rather than the diagram 
subtraction method [37 l. 

The Mj background has a very low efficiency to pass the selection, making it difficult to simulate 
enough selected events, but it has a large enough cross section to make it significant. The Mj 
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4 Event selection and reconstruction 


background is modeled using the sideband data, subtracting the contributions of simulated 
processes, which are normalized according to the integrated luminosity and their cross sections 
and selection efficiencies. 

Several alternative models of tf production are used to investigate systematic uncertainties and 
to evaluate the performance of the method. Alternative SM tf simulations are generated with 
MadGraph and with MC@NLO (version 3.41) 11381 using the CTEQ6 PDFs (versions LI and 
M, respectively) II391I . Systematic uncertainties related to the factorization and renormalization 
scales are evaluated using powheg tf samples in which both scales are increased or decreased 
simultaneously by a factor of two from their nominal values, equal to the event momentum 
transfer squared; these control samples are processed with the FastSim 1401 simulation of the 
CMS detector. A set of six models in which tf production kinematics are modified by the pres¬ 
ence of new physics are generated with MadGraph, and are described in detail in Ref. IITSl . 
The models are chosen to have parameters not yet excluded by other experimental constraints. 
The set includes a model with an added complex gauge boson Z' IllTl with a mass of 220 GeV 
and a coupling to right-handed up-type quarks. Other models in the set include parametrized 
color-octet vector bosons (axigluon) models |2|, in which the axigluon has nonzero mass and 
chiral couplings. Three models include a light axigluon with a 200 GeV mass and coupling 
characterized as right, left, or axial. Two models include a heavy axigluon with a 2 TeV mass 
and right or axial coupling. 


4.2 Reconstruction of top quarks 


Top quarks are reconstructed using the most likely assignment of the reconstructed jets to the 
tf decay partons. Jet four-momenta are corrected according to their parton assignment and a 
kinematic fit, which uses the known top quark and W boson masses ll]. The neutrino momen¬ 
tum is calculated analytically Il42l . The top quark and antiquark four-momenta are found by 
summing the four-momenta of their respective decay products. The charge of the leptonically 
decaying top quark is determined by that of the electron or muon, while the top quark that 
decays into jets is assumed to be of the opposite charge. 

All jet assignments are considered in selecting the assignment of maximum likelihood. The 
selection ensures that the number of jets in the event Nj is at least four. There are Nc = 
\Nj\/{Nj — 4)!, or a minimum of 12, possible jet assignment combinations. Each assignment 
is represented by a tuple {a, b, c, d, {x}), where a represents the b jet associated with t —> hlv^ 
decay, h represents the b jet associated with t —> bqq decay, c and d represent the two jets from 
hadronic W boson decay, ordered by ipj, and {v} represents any additional jets in the event, 
ordered by px- The correct assignment in simulation is designated {d, b, c, d, {x}). 


The scale factors for correcting the energy of the jets from the reconstruction to the parton 
level are obtained from tf simulation, following the event selection, for b jets from top quark 
decay, jets from W boson decay, and other jets. Gorrections are found as a function of pj in 
three bins of absolute pseudorapidity, with upper bin boundaries at |p| = 1.131, 1.653, and 
2.510, corresponding to the calorimeter barrel, transition, and endcap regions. The corrections, 
shown in Fig.|^ are applied to the measured jet energies according to the assignment. 

The likelihood of a given jet-to-parton assignment i is 


Li 




(9) 


where is the likelihood of the jet b tagging discriminants, is the likelihood ratio 

of the invariant masses of jet combinations associated with t —> bqq decays, and LRf is the 
likelihood ratio of the x.^ associated with the products from t —> hiv£ decays. 
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Figure 2: The median value of the logarithm of the ratio of parton energy to measured energy, 
as a function of measured px in three bins of \r]\, for (left) b jets from top quark decay, (center) 
jets from W boson decay, and (right) other jets. 

The CSV b tagging discriminant associates a value fi with each jet. The conditional CSV prob- 
ability densities B = p{fi\a,b), Q = p{j5\c,d), and J\f = p(/l|{x}) are shown in Fig. The 
likelihood of a given jet assignment i, considering the associated CSV values {j}}, is 

Lpv = Bifia)B{fi,)Qific)Q{M n (10) 



Figure 3: The conditional probability densities of the CSV b tagging discriminant from simu¬ 
lation for jets from b quarks, jets from W boson decay, and other jets. 


The jet invariant masses associated with t ^ bqq decays are mj,cd and nicd, with parton-level 
jet corrections applied based on the assignment. Their two-dimensional probability distribu¬ 
tion for correct assignments is shown in Fig. The mean and variance of this distribution are 
calculated after removing the tail of the distribution, defined as the lowest-valued bins which 
integrate to a 1% probability, in order to find a Gaussian approximation. Contours of the ap¬ 
proximation, in standard deviations, are also shown in Fig. The distance of a point from 
the center of this Gaussian function, expressed in units of standard deviations, is denoted by 
"mass standard deviations" (MSD). Probability distributions in MSD for correct and incorrect 
assignments, and their ratio are shown in Fig. 
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Figure 4: The two-dimensional probability density from simulation of jet invariant masses 
from W boson and top quark decay is shown (top), along with contours in stan¬ 

dard deviations (MSD) of the corresponding Gaussian approximation. Probability densities for 
correct and incorrect jet assignments (middle) are shown (left) for MSD and (right) for xl 
of the leptonically decaying top quark reconstruction. The corresponding likelihood ratios are 
shown below. 
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The momentum of the neutrino associated with the leptonically decaying top quark is calcu¬ 
lated according to Ref. Il42l using and the four-momenta of the charged lepton and jet a. 
Correct and incorrect assignments of jet a are discriminated using the test statistic 




2 

a 


■d. 


( 11 ) 


where cr^ is the covariance matrix for pjl^'®®, derived from the momentum uncertainties of the 
reconstructed objects in the event, and d is the difference vector in the transverse plane between 
p^®® and the neutrino momentum solution. The distributions of the square root of Xa 
correct and incorrect assignments of jet a, and their ratio LR^, are shown in Fig.|^ 

Of the selected tt events, about half contain reconstructed jets corresponding to all four tt decay 
partons. In about 60% of those events, the assignment with the maximum likelihood is also the 
correct assignment. 


4.2.1 Kinematic fitting procedure 


The energy resolution of jets corresponding to the most probable assignment can be improved 
beyond the intrinsic resolution of the CMS detector using the constraints from the masses of the 
top quark and W boson. These constraints are applied in two stages. First, jet four-momenta 
Pi are scaled to p, = (1 -|- 3i)pi with the free parameters 5i, for i equal to h, c, or d, in the 
minimization of the test statistic 


/ mw-nicd V' 

V Fw/2 J 


/ nit - mcd 

V Ft/2 



2 


( 12 ) 


Here, r, are the pj- and //-dependent relative jet energy resolutions (Te/E, and mcd and m},cd 
are the invariant masses calculated with the scaled jet four-momenta. The mass and width 
parameters used for the W boson and top quark are: nit/q = 80.4 GeV; nit = 172.0 GeV; Fw = 
2GeV; and Ft = 13 GeV. The values of Ft and Fw represent the empirical resolution of the 
reconstructed particle masses for a single event, rather than the natural particle widths. The 
momentum and energy of the top quark that decays into jets are given by pi- In the 

second stage, the four-momentum of jet a is scaled to p^ = (1 -|- 6a)pa with the free parameter 
5a, to minimize the test statistic x^ from Eq. ( |TT| . At each step of this minimization, is 
calculated with the charged-lepton four-momentum, the candidate pa, and pjF^®® corrected for 
the scaling of the a, h, c, and d jets. The uncertainty in the corrected pjF'®® is reduced from that 
of the nominal reconstruction by removing a portion of the uncertainty corresponding to the 
energies of the a, b, c, and d jets. The neutrino momentum associated with the minimized xl is 
summed with the corresponding pa and the charged lepton four-momentum to find the energy 
and momentum of the leptonically decaying top quark. 


4.3 Discrimination among three popuiations 

To measure the sample composition in the data after the event selection, we construct a like¬ 
lihood discriminant designed to distinguish among populations of events from three leading 
processes: tf, Mj, and Wj, denoted respectively by Pi, P 2 , and P 3 in the following generalized 
construction. As will be discussed in Section the contributions from St and DY are con¬ 
strained to those predicted by their SM cross sections. The likelihood that an event belongs 
to population P is Lp = ]/[/ i^i), where {Vi} is a set of random variables with probability 

densities if. For independent {Vi}, the likelihood ratio Lp^/Lp.^ is more discriminating than 
any single constituent variable Il4^ . One can construct a likelihood-ratio-based discriminant 

A = Arg (Lp^ + e2'^/3^p^ + /tt, (13) 
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5 Measurement procedure 


the principal value of which is bounded periodically on (- 1 , 1 ] and is symmetric under exchange 
of any two of the three populations. Figure illustrates the construction. Populations Pi, P 2 , 
and P 3 tend to concentrate at A of 0, 2/3, and —2/3, respectively. 



Figure 5: The angle ttA of the resultant sum of three vectors spaced at equal angles, in which 
the magnitude of each is the likelihood of the respective population. The dashed arrows are 
translations of the and vectors which illustrate the construction of the sum. The 

circle is shown to indicate the relative scale. 


Three observables are u sed to construct the likelihoods for the discriminant. The first is the 
transverse mass Mj = V2£tEt'^®® (1 — cos(p), where is the magnitude of the charged lepton 
Pt, (p is the azimuthal angle between the charged lepton momentum and and is the 
magnitude of The second is the probability from the MSD that at least one jet assignment 
is the correct one, defined as Pmsd = (^c + where Nc and are 

defined in Section [4^ The third is the probability from the CSV b tagging discriminant that at 
least one jet assignment is the correct one, defined as 


Pcsv = 


eELpv 

eELpv + (i_e)]v,nye{jets}W;)' 


(14) 


where Pp^ and J\f are defined in and before Eq. ( [l^ , and the prior probability that at least one 
assignment is correct is set to e = 0.05. A value of c = 0.05 is chosen because it results in a 
more balanced distribution of Pcsv than, for example, a flat prior with e = 0.5. We found these 
observables to be highly discriminating and mostly independent of each other. 


The probability distribution for each population is shown as a function of the discriminant 
and each of its input observables in Fig. The Mj probability distributions for the inputs are 
calculated using fixed SM cross sections, as determined by the simulations, for the subtracted 
tf and Wj contributions. 


5 Measurement procedure 

A two-stage maximum-likelihood fit is employed to sequentially measure the sample compo¬ 
sition, using the A distribution, and the charge asymmetry, using the Y^''p distribution. 

The sample composition is determined independently for each lepton channel by fitting a 
model to the observed distribution Nf in the discriminant A. Normalized five-bin templates 
in A are constructed from the selected events for each of the simulated processes, including tf, 
Wj, St, and DY, in both the signal and sideband regions. The total number of events expected 
in each region from simulated process j is the product of the integrated luminosity C, the cross 
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19.6fb ' (8TeV) 



A 







Figure 6: The probability distribution of the discriminant A for (top left) selected e+jets events 
and (top right) selected ;^+jets events, for the simulated Wj and tf populations, and for the Mj 
population, which is modeled from the sideband data with simulated contributions subtracted. 
The probability distributions in each observable used to construct the discriminant are shown 
for (middle) e+jets and (bottom) /^+jets channels. The overflow is included in the rightmost bin 
of the Mt distributions. 
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5 Measurement procedure 


section aj, and the selection efficiency. The selection efficiencies are taken directly from simu¬ 
lation. Each cross section is parametrized by the relative change Sj from the nominal value aj. 
The integrated luminosity is parametrized by the relative change 5c from the measured central 
value. The Mj distribution in A is determined at each iteration of the fit by subtracting the side¬ 
band contributions of simulated processes from the sideband region in data, and then rescaling 
this distribution by a positive parameter F^y The total number of expected events in each bin, 
A|, is the sum of the expected contributions from the tt, Wj, Mj, St, and DY processes. Parame¬ 
ters 5c, 5st, and (^dy are held fixed to zero or to nonzero values when investigating systematic 
uncertainties. The sample composition is determined by finding values of the free parameters 
i^M)'^My that maximize the product of the Poisson likelihoods over the bins, given 

observations N- and expectations Af. The fit is implemented using RooPlT 1441 . 

The charge asymmetry is determined from a fit to the five-bin distribution in based on the 
same model. With the sample composition parameters held fixed, and following Eq. Q, the 
POWHEG tf model is extended by introducing a new free parameter a. to provide changes in the 
relative magnitudes of the symmetric and antisymmetric components of YT‘^, shown in Eig. 
The difference in shape of the e+jets and /^+jets templates is a result of the different rapidity 
coverage between the two lepton flavors. The modeled charge asymmetry is that of the tf base 
model, Ac, scaled by a, 

A? = aAl (15) 

The charge asymmetry in the data is estimated by finding the value of a. that maximizes the 
product of the Poisson likelihoods over the bins. The results from the independent measure¬ 
ments in both lepton charmels are combined before evaluating the systematic uncertainties. 



Pigure 7: The (left) symmetric and (right) antisymmetric components of the probability 
distribution for selected tf simulation events in the e+jets and f/+jets channels. The vertical 
bars show the statistical uncertainties, while the horizontal bars display the bin widths. 


5.1 Performance and calibration 

The performance of the method is checked on simulated samples constructed using tf events 
based on the extended POWHEG model as well as the alternative tf simulations described in 
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Section 4.1 The extended powheg model is checked using various values of the parameter 


a. by measuring pseudo-experiments generated with Poisson variations of the best-fit model, 
mimicking fluctuations expected in data. The statistical uncertainty measured in 68% of the 
pseudo-experiments is greater than the absolute difference between the measured and expected 
values. The distribution in statistical uncertainty in with an expected value of 0.258%, is 
shown in Fig. 



Figure 8: The distribution of the statistical uncertainty in from measurements using pseudo¬ 
experiments, with an expected value of 0.258%. The statistical uncertainty extracted from the 
data is marked by the arrow. 


The alternative tf simulations are checked using pseudo-experiments with the sample com¬ 
position of the measured data, constructed with fixed background and Poisson-varied signal 
templates, to find the uncertainty from the sample statistics of each alternative model. Figure 
1 ^ shows the difference between the expected measurement and the input charge asymmetries, 
or the bias, for each model. The bias for the extended POWHEG models is negligible. The bias 
of the method when applied to samples produced using the SM-based generators MadGraph 
and MC@NLO is compatible with the systematic uncertainty in assigned to model-related 
sources, represented by the shaded band in the plot. Model-related systematic uncertainty 
sources consist of simulation statistics, modeling of tf production, PDFs, and renormalization 
and factorization scales. Similar calibrations of the beyond-SM alternatives of tf production 
considered in this study all show biases statistically compatible with zero. 

5.2 Systematic uncertainties 

Systematic uncertainties in cc are investigated after the statistical combination of the two chan¬ 
nels by repeating the measurement with variations in the parameters or the distributions. The 
second stage of the fit is repeated with sample composition parameters varied independently 
to the upper and lower bounds of their 68% confidence intervals. Parameters for the integrated 
luminosity and the St and DY cross sections are varied similarly, but both fit stages are re¬ 
peated. The effects of statistical uncertainty in the sideband distributions of the data and the 
simulations are investigated with ensembles of alternative templates, generated by varying the 
originals according to Poisson statistics. Uncertainty in the jet energy scale and jet energy res¬ 
olutions are investigated by repeating the reconstruction using rescaled jet energies, according 
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5 Measurement procedure 
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Figure 9: The bias in the measured charge asymmetry for SM simulations and alternative 
tt models, based on extended POWHEG SM templates, versus the charge asymmetry in each 
sample. The beyond-SM samples are MadGraph simulations of Z' bosons and axigluons 
with masses of 200 GeV and 2 TeV. Uncertainty in the bias of the extended POWHEG model is 
dominated by the number of pseudo-experiments used, while the uncertainty in the bias of 
each alternative model is dominated by the statistical uncertainty in the sample. The hatched 
area shows the systematic uncertainty in the measurement of assigned to the modeling. 

to the Pt and t] of each jet. Likewise, the modeling of the b tagging discriminator is varied by 
repeating the reconstruction with scaled discriminant values. The PDFs are varied by event 
reweighting of the tt templates to the 90% confidence limits of each of the 26 GTIO eigenvec¬ 
tors and the strong coupling parameter, independently; this method is more conservative than 
the widely used PDF4LHG prescription |45l since it is sensitive to the possibility of a strong 
correlation between the antisymmetric component of the distribution and any eigenvector, 
while varying the distribution to the minimum and maximum of the uncertainty envelope is 
not. Uncertainty from the modeling of tf production is estimated by measuring the data us¬ 
ing extended MC@NLO templates rather than the extended POWHEG templates, and varying 
the top quark mass by ±0.9 GeV. The factorization and renormalization scales are varied by 
substituting distinct samples for the tf templates, described in Section 4.1 The heavy-flavor 
content of Wj events is varied by adding or subtracting 20% of the expected contribution of a 
distinct W+bb sample to the expected Wj templates. Variations in distributions for the pileup 
multiplicity and the top quark pj, and variations in the trigger and identification efficiencies 
for the charged leptons, are accomplished by event reweighting. The uncertainty in the shape 
of the Mj templates is dominated by the statistical uncertainty in the data sidebands; the Mj an¬ 
tisymmetric components are statistically compatible with zero asymmetry, and no additional 
shape systematic is included beyond that of the statistical shape uncertainty 
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The magnitudes of the systematic uncertainties are given in Table The total systematic un¬ 
certainty of 0.33% is comparable to the statistical uncertainty in the measurement, and is dom¬ 
inated by the statistical uncertainty in the shapes of the data sidebands. 

Table 2: Uncertainty in the combined measurement of from systematic sources, ordered by 
decreasing magnitude. 


(%) 

Source of systematic uncertainty in A? 

0.18 

Data sideband statistical uncertainty 

0.15 

Simulation statistical uncertainty 

0.14 

Jet energy scale 

0.14 

Renormalization and factorization scales 

0.073 

Modeling of b tagging 

0.037 

asi (Gt -6 p-t) 

0.035 

Jet energy resolution 

0.026 

Modeling of pileup 

0.023 

Wbb content 

0.021 

Ratio of St cross sections, Gt/ 

0.021 

Modeling of tt production 

0.018 

PDFs 

<0.010 

ctdy, ^wj, trigger F^y 4 , as 

<0.001 

Trigger Ce, IDe, ID^, 

0.33 

Total 


6 Results 

The measured sample composition is presented in Table Figure [^shows the data from each 

Table 3: Results from the fit of the sample composition, in thousands of events, for the e+jets 
and ji+iets channels. The statistical uncertainty in the last digit is indicated in parentheses. The 
results of the simultaneous fit in both channels are included only for comparison and are not 
used in the measurement of A^. 


Thousands of events 


tf 

Wj 

Mj 

St 

DY 

Total 

Observed 

e only 

207.1(8) 

49.1(9) 

50(1) 

14.0 

5.4 

326(2) 

326.185 

ji only 

242.5(8) 

58.9(6) 

18.7(5) 

16.5 

4.3 

341(1) 

340.911 

Simultaneous fit e 

207.1(5) 

49.5(4) 

50.2(6) 

14.0 

5.4 

326.2(9) 

326.185 

F 

242.6(6) 

58.8(5) 

18.7(5) 

16.5 

4.3 

340.9(9) 

340.911 


channel projected along and A, overlaid with the results of the fitted model. 


Curves of the negative logarithm of the likelihood for both channels are shown in Fig. 11 
along with the combined 68% confidence interval for A^. The predictions of POWHEG, Kuhn 
and Rodrigo | 8 J, and Bernreuther and Si [9l are also plotted. Subfigures of Fig. show the 
range of the antisymmetric components covered by the models at ±1 standard deviation of the 
statistical uncertainty. The combined charge asymmetry using both channels is A^ = [0.33 ± 
0.26 (stat) ± 0.33 (syst)]%, which is tabulated with the predictions in Table The combined 
uncertainty is 0.42%. 
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6 Results 
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Figure 10: Sample composition is measured using the discriminant A distribution (top), in a 
model with contributions from tf, Wj, Mj, and St + DY. With the sample composition subse¬ 
quently fixed, the amplitude of the antisymmetric tt contribution is measured in the distri¬ 
bution, shown decomposed into symmetric (middle) and antisymmetric (bottom) components. 
The thick line shows the antisymmetric component of the fit model. The measurements are 
performed independently on the (left) e+jets and (right) f<+jets samples. 
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Figure 11: At top, the negative logarithm of the likelihood is shown as a function of a (upper 
axis) and (lower axis), for e+jets (closed circles) and }i+iets (open circles) measurements. The 
statistical uncertainty in each is given by the intersections of the parabolas with — log L = 0.5, 
which are marked by arrows. The 68% confidence interval of the combined measurement 
is compared with those of the SM predictions by POWHEG, Kiihn and Rodrigo [I8|, and Bern¬ 
reuther and Si |l9f. At bottom, the antisymmetric component of the distributions in data 
and the model are shown for (left) e+jets and (right) f/+jets, for the central value (solid), and for 
the upper (dashed) and lower (dotted) limits of the 68% statistical confidence intervals. 
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7 Summary 


Table 4: Comparison of charge asymmetry measurements and predictions. 


Source 

Al{%) 

e+jets 

0.09 ±0.34 (stat) 

/^+jets 

0.68 ± 0.41 (stat) 

Combined 

0.33 ± 0.26 (stat) ± 0.33 (syst) 

POWHEG CTIO 

0.56 ± 0.09 

MC@NLO 

0.53 ± 0.09 

Kiihn and Rodrigo |8J 

1.02 ±0.05 

Bernreuther and Si 191 

1.11 ±0.04 


The measured tf production charge asymmetry is compatible with another CMS -\/s = 8 TeV 
measurement liT^ , which uses an unfolding technique on the same data, and with the most re¬ 
cent Monte Carlo predictions and theoretical calculations. The template method incorporates 
more information from the model than used in comparable unfolding techniques IITStiT^ by 
using the distribution of the antisymmetric component of the probability density. This extra in¬ 
formation carries the benefit of reduced statistical uncertainty, at the expense of greater model 
dependence, reflected in the systematic uncertainty. The contributions to the uncertainty from 
statistical and systematic sources are comparable in size. Since the systematic uncertainty is 
dominated by the statistical uncertainty in the templates, it can be reduced in future analy¬ 
ses through increased numbers of events in the simulation and in the sidebands in the data. 
The uncertainty in the POWHEG prediction arises from systematic uncertainties in the PDFs, 
the renormalization and factorization scales, and the strong coupling constant. A graphical 
comparison of the results and predictions is shown in Fig. 


7 Summary 

The forward-central tt charge asymmetry in proton-proton collisions at 8 TeV center-of-mass 
energy has been measured using lepton+jets events from data corresponding to an integrated 
luminosity of 19.6 fb^^. Novel techniques in top quark reconstruction and background dis¬ 
crimination have been employed, which are likely to be of interest in future analyses. The 
measurement utilizes a template technique based on a parametrization of the SM. The result, 
Ac = [0.33 ± 0.26 (stat) ± 0.33 (syst)]%, is the most precise to date. It is consistent with SM 
predictions, but does not rule out the alternative models considered. 
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